# Global Feature Extraction
Vit Large Patch16 Siglip Gap 256.v2 Webli
Apache-2.0
A ViT image encoder based on SigLIP 2, employing global average pooling with the attention pooling head removed, specifically designed for image feature extraction.
Text-to-Image
Transformers

V
timm
95
0
Vit So400m Patch14 Siglip Gap 384.webli
Apache-2.0
Vision Transformer model based on SigLIP, utilizing global average pooling for image features
Image Classification
Transformers

V
timm
96
0
Featured Recommended AI Models